release/v0.61.1 #2895

jp-agenta · 2025-11-10T14:23:35Z

release/v0.61.1

CLAassistant · 2025-11-10T14:23:44Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ jp-agenta
❌ GitHub CI

GitHub CI seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

Copilot

Pull Request Overview

This PR bumps the version from 0.60.2 to 0.61.1 and introduces significant enhancements to the evaluation system, particularly around custom evaluations, evaluator revisions, and metric handling. The changes span across the web frontend (both OSS and EE editions) and the Python SDK.

Key changes include:

Introduction of "custom" evaluation type throughout the codebase alongside existing "auto", "human", and "online" types
Implementation of evaluator revision fetching and merging logic to support version-based evaluator definitions
Enhanced metric column factory with improved slug resolution, nested metric support, and better type inference from statistics
SDK workflow improvements making slug parameters optional for built-in evaluators
New SDK model structures for evaluations, testsets, and git-based artifact management
Improved metric key normalization and fallback resolution in the focus drawer
CSV export functionality for custom evaluations

Reviewed Changes

Copilot reviewed 113 out of 118 changed files in this pull request and generated 55 comments.

Show a summary per file

File	Description
web/package.json	Version bump to 0.61.1
web/oss/package.json	Version bump to 0.61.1
web/ee/package.json	Version bump to 0.61.1
web/oss/src/state/evaluators/atoms.ts	Added evaluator revision fetching and merging logic with new utility functions
web/oss/src/state/app/hooks.ts	Enhanced app filtering logic to exclude SDK evaluation apps
web/oss/src/state/app/atoms/fetcher.ts	Added filtering and new app detail query atom
web/oss/src/lib/hooks/useEvaluators/types.ts	Added EvaluatorRevisionDto types
web/oss/src/lib/hooks/useEvaluators/index.ts	Removed unused rest parameter
web/oss/src/lib/Types.ts	Fixed semicolon formatting
web/ee/src/lib/metricColumnFactory.tsx	Major refactor with nested metric support, improved slug resolution, and type inference
web/ee/src/components/pages/evaluations/*	Added "custom" evaluation type support throughout
web/ee/src/components/HumanEvaluations/assets/utils.tsx	Enhanced metric collection and evaluator slug resolution
web/ee/src/components/EvalRunDetails/*	Updated to support custom evaluation type
sdk/pyproject.toml	Version bump and dependency updates
sdk/agenta/sdk/workflows/utils.py	Renamed PARAMETERS_REGISTRY to CONFIGURATION_REGISTRY
sdk/agenta/sdk/workflows/builtin.py	Made slug parameter optional for all built-in workflows
sdk/agenta/sdk/utils/references.py	New utility for slug generation
sdk/agenta/sdk/utils/client.py	New authenticated API client utility
sdk/agenta/sdk/models/*	New model files for workflows, evaluations, testsets, git, and blobs

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

web/ee/src/components/EvalRunDetails/AutoEvalRun/assets/VariantTag.tsx

...EvalRunDetails/AutoEvalRun/components/EvalRunFocusDrawer/assets/FocusDrawerContent/index.tsx

sdk/agenta/sdk/models/workflows.py

sdk/agenta/sdk/evaluations/preview/utils.py

api/ee/tests/manual/evaluations/sdk/test_openai_agent_evaluator.py

release/v0.61.1

cce037b

Copilot AI review requested due to automatic review settings November 10, 2025 14:23

dosubot bot added the size:XXL This PR changes 1000+ lines, ignoring generated files. label Nov 10, 2025

dosubot bot added Backend SDK tests labels Nov 10, 2025

Copilot AI reviewed Nov 10, 2025

View reviewed changes

Merge branch 'main' into release/v0.61.1

e5b16c2

dosubot bot added size:S This PR changes 10-29 lines, ignoring generated files. and removed size:XXL This PR changes 1000+ lines, ignoring generated files. labels Nov 10, 2025

jp-agenta enabled auto-merge November 10, 2025 14:40

ashrafchowdury approved these changes Nov 10, 2025

View reviewed changes

jp-agenta merged commit d5a9463 into main Nov 10, 2025
5 of 6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

release/v0.61.1 #2895

release/v0.61.1 #2895

Uh oh!

jp-agenta commented Nov 10, 2025

Uh oh!

CLAassistant commented Nov 10, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

release/v0.61.1 #2895

release/v0.61.1 #2895

Uh oh!

Conversation

jp-agenta commented Nov 10, 2025

Uh oh!

CLAassistant commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

CLAassistant commented Nov 10, 2025 •

edited

Loading